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VOICE OPERATED REMOTE CONTROL SYSTEM 

Background o f the Tnventinn 
1. Field of the Invention 
5 The present invention relates to systems for 

controlling a device by using voice recognition of 
spoken commands, and more particularly to systems of 
that type in which the commands are communicated to the 
controlled device by wireless transmission. 

10 2. History of the Prior Art 

It is well known to provide electronic appliances, 
particularly video devices such as television sets and 
video cassette recorders (VCRs) , which are supplied 
with wireless remote controls. The wireless remote 

15 controls transmit signals to the controlled appliance 
or other device in response to the pressing of 
appropriate buttons by a user. The controlled devices 
receive such signals, decode them and perform a 
function based on the specific signal received. The 

20 transmitted signals are usually digital in nature, and 
typically are transmitted using infrared light 
transmission. Alternatively, radio frequency 
transmission is used in some systems. 

Recent developments have led to voice activated 

25 remote controls. Such remote controls utilize 

sophisticated electronics to recognize spoken commands, 
translate the commands into the traditional digital 
remote control signals, and transmit the control 
signals to the controlled device. Examples of such 

30 systems are provided by co-pending application Serial 
No. 07/915,112 of Bissonnette et al., entitled Voice 
Operated Remote Control Device, by co-pending 
application Serial No. 07/915,938 of Bissonnette et 
al., entitled Voice Recognition Apparatus and Method, 

35 and by co-pending application Serial No. 07/915,114 of 
Fischer, entitled Remote Control Device. All three 
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applications were filed on July 17, 1992 and are 
commonly assigned with the present application. 

Co-pending application Serial No. 07/915,112 
describes a system utilizing a voice operated, hand- 
5 held, battery powered portable remote control device 
for controlling remote electronic devices by the 
transmission of infrared control signals. The voice 
operated remote control device accepts voice commands 
spoken by the user, performs voice recognition pattern 

10 matching on the spoken word by comparing the same 
against pretrained templates to determine the 
appropriate corresponding command, determines the 
specific infrared remote control code or set of codes 
that represent the function corresponding to the 

15 command, and transmits the infrared remote control 

codes to the electronic device to be controlled. The 
voice operated remote control device can be operated by 
multiple users, either by voice or by manual control of 
the selected function on the remote control device. 

20 The voice commands can represent either real time 

commands, which are transmitted immediately after the 
determination of the voice command, or programmed delay 
events which delay the transmission of the remote 
control codes or set of remote control codes for a 

25 preset period of time specified by the user. 

Co-pending application Serial No, 07/915,938 
describes a method and apparatus for improving the 
quality of voice templates for speaker dependent voice 
recognition devices, through the use of two word 

30 templates for each word in a vocabulary and 

qualification of the word templates. By comparing two 
word templates corresponding to the same word to each 
other, the apparatus takes the result of the comparison 
and performs an evaluation to determine the relative 

35 quality of the word templates. If word template 

quality is deemed to be less than an acceptable level, 
the apparatus requests the user to retrain the specific 
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word in order to improve the recognition accuracy rate 
of the apparatus. Both resulting templates are later 
used during voice recognition. The system includes a 
noise rejection system using noise templates during a 
5 pattern matching process to significantly reduce the 
required processing power over commonly used digital 
noise filters. The system further includes means for 
correcting voice recognition errors through use of a 
voice command. Correcting errors by means of voice 

10 commands enables the user to correct errors while 

speaking into the apparatus, without interrupting the 
flow of the voice process. The fact that an error 
occurred on a particular word within a particular word 
group is used to assist in replacing the word template. 

15 Co-pending application Serial No. 07/915,114 

describes a method and apparatus whereby a remote 
control device executes system control functions in 
response to a single user command request. The 
apparatus self-configures based on the configuration of 

20 the equipment to be used. System control functions, 

which can consist of one or more controls from a single 
remote control or from multiple remote controls, are 
controlled by transmitting a string or sequence of 
commands from the apparatus to single or to multiple 

25 pieces of equipment for control of that equipment. The 
system control functions are selected by the user by 
activation of a single key representing the specific 
function. The apparatus self-conf igures, determining 
what system functions are applicable based on the 

30 equipment which the user has and based on the functions 
learned from the user's remote control. 

The approach exemplified by the three co-pending 
applications identified above allows the use of voice 
recognition technology with controlled devices that 

35 were not designed for such technology. The voice 

remote control mimics the traditional remote control in 
the way in which it communicates with the controlled 
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device. Unfortunately, this requires the addition of a 
significantly higher level of technology in the remote 
control device itself, much of which is already present 
in the controlled device. 
5 For example, virtually all television sets and VCRs 

utilize 8-bit microcontrollers with on-board ROM and 
RAM memory capacity. Such devices often include some 
type of display. Many newer television sets have on- 
screen displays. All such devices have power supplies 

10 for powering the electronics. A voice activated remote 
control must duplicate all such components. 

Moreover, in order to mimic the digital remote 
control signals, such voice activated remote control 
devices must have the ability to either "learn" the 

15 control signals or have a substantial library of 

existing codes built into the ROM thereof. The only 
components added to the voice activated remote control 
device that are not duplicated in the controlled device 
or that are added only for the purpose of mimicking the 

20 standard remote controls, are a microphone, audio 
amplifier/ filter circuit and an analog-to-digital 
converter • 

If the controlled device were to be capable of 
supporting voice control itself, then the remote 

25 control device would only have to transmit the voice 
signal without the need for the sophisticated voice 
activated remote controls. In such an arrangement, the 
remote control device would only have to convert the 
voice signal from the microphone into a format that can 

30 be transmitted. The result would be a relatively 

simple remote control device in which multiple buttons 
and the microcontroller used to interpret button 
operation would not be needed. 

Accordingly, it would be desirable to provide a 

35 voice operated remote control system in which a 

relatively simple and inexpensive remote control device 
is used to transmit voice commands spoken by the user 
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to the controlled device , in wireless fashion. The 
controlled device would desirably include the voice 
recognition apparatus , so as to utilize the existing 
microprocessing and memory capacities of the controlled 
5 device with only minor additions thereto and so as to 
greatly simplify the remote control device. 

Brief Description of the Invention 
The present invention provides a voice operated 
remote control system in which a remote control device 

10 responds to voice commands from the user to transmit 
representations of the voice commands to a controlled 
device, in wireless fashion. The controlled device 
responds to the transmitted representations by 
undergoing a voice recognition process in an attempt to 

15 recognize the voice commands of the user. Recognized 
voice commands are used to produce action routines for 
specific functions to be performed by the controlled 
device. The incorporation of voice recognition 
apparatus within the controlled device enables the 

20 remote control device to be of simple and inexpensive 
design. The remote control device need only include 
apparatus such as a microphone for converting the voice 
commands into analog audio signals and apparatus for 
modulating or otherwise processing such signals for 

25 transmission to the controlled device. 

In a preferred embodiment of a voice operated 
remote control system in accordance with the invention , 
an audio wireless remote control device includes a 
microphone for converting voice commands into analog 

30 audio signals. Following amplification of such signals 
by an audio amplifier, the signals are applied to a 
voltage controlled oscillator or other appropriate 
circuit for frequency modulation, prior to application 
to a driver and an infrared transmitter. The infrared 

35 transmitter transmits the frequency modulated audio 
signal to the remotely located controlled device, in 
wireless fashion. The remote control device is battery 
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powered and includes a push-to-talk switch for 
conserving battery power by turning on the circuitry 
only when the user desires to enter a voice command 
into the remote control device. 
5 An audio receiver located within the controlled 

device includes an infrared sensor for sensing the 
transmitted signals from the remote control device. 
The sensed signals are amplified and filtered prior to 
application to a phase-locked loop receiver for 

10 demodulation of the frequency modulated signal. The 

demodulated output of the phase-locked loop receiver is 
filtered and amplified to provide a voice signal. The 
phase-locked loop receiver also provides a lock detect 
output to indicate when a transmitted signal has been 

15 received. 

The voice signals from the audio receiver are 
applied to voice recognition apparatus within the 
controlled device. The voice recognition apparatus 
comprises a microcontroller which may include an 

20 analog-to-digital converter , together with a 

microprocessor and ROM and RAM memories. The analog- 
to-digital converter converts each voice signal 
produced by the audio receiver into an incoming digital 
voice signal. The memories include a reference memory 

25 for storing a plurality of reference digital voice 

templates, and a program memory for storing a control 
program. The microprocessor generates an incoming 
digital voice template from the incoming digital voice 
signal, executes the control program to determine 

30 whether the incoming digital voice template is 

substantially equivalent to one of the reference 
digital voice templates, and determines what action to 
take based on the incoming digital voice template. 
Typically, one or more action routines are carried out 

35 by the microprocessor to accomplish the action or 

actions dictated by the voice commands of the user. 
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The controlled device may comprise any appliance or 
other device capable of being electronically 
controlled. An example of such a device is a 
television set having functions such as on/off , channel 
5 selection, volume and mute which are controlled in 

response to voice commands from the user entered at the 
remote control device. The lock detect signal from the 
phase-locked loop receiver within the audio receiver 
may be used with such device to determine when a voice 
10 command is no longer being transmitted from the remote 
control device. 

Brief Description of the Drawings 
A better understanding of the invention may be had 
by reference to the following detailed description 
15 taken in conjunction with the accompanying drawings, in 
which: 

Fig. 1 is a basic block diagram of a voice operated 
remote control system in accordance with the invention; 
Fig. 2 is a detailed block diagram of the remote 
20 control device of Fig. 1; 

Fig. 3 is a detailed block diagram of the audio 
receiver of Fig. 1; 

Fig. 4 is a detailed block diagram of the 
controlled device of Fig. 1; 
25 Fig. 5 is a basic flow chart illustrating the 

operation of the system of Fig. 1; and 

Fig. 6 is a detailed flow chart illustrating the 
operation of the system of Fig. 1 in connection with 
the television set shown in Fig. 4. 
30 Detailed Description 

Fig. 1 shows a voice operated remote control system 
10 in accordance with the invention. The remote 
control system 10 of Fig. 1 includes a remote control 
device 12 for transmitting voice signals to a remotely 
35 located controlled device 14 in response to voice 
commands. The remote control device 12 includes a 
microphone 16 for receiving voice commands from the 
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user in the form of spoken words. The microphone 16 
converts the spoken words into analog audio signals for 
application to an audio wireless remote control circuit 
18. The audio wireless remote control circuit 18 
5 processes the analog audio signals from the microphone 
16 into an appropriate form for transmission to the 
controlled device 14 , using an infrared transmitter 20. 

As described hereafter in connection with Figs. 2 
and 3, the audio wireless remote control circuit 18 

10 includes circuitry for frequency modulating (FM) the 
analog audio signal prior to application to a 
transmitting device in the form of the infrared 
transmitter 20. The infrared transmitter 20 transmits 
the voice commands from the user to the controlled 

15 device 14 in wireless fashion. 

While the voice operated remote control system 10 
is described herein in connection with frequency 
modulation and demodulation of the voice commands and 
transmission in the form of an infrared signal, it 

20 should be understood that other signal processing and 
transmission techniques can be used. For example, 
forms of modulation other than frequency modulation, 
such as amplitude modulation (AM) , can be used to 
produce a representation of the analog audio signal 

25 appropriate for transmission. Also, modulated or 

otherwise modified signals can be transmitted in other 
than infrared signal form, such as in the form of a 
radio frequency signal. 

The remote control device 12 comprises a small, 

30 portable, hand-held unit into which the user speaks the 
voice commands. The remote control device 12 is 
remotely located relative to the controlled device 14, 
but nevertheless is close enough to the controlled 
device 14 so that the voice signals transmitted by the 

35 remote control device 12 are received by the controlled 
device 14. 
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As described in detail hereafter, the controlled 
device 14 comprises a television set, in the present 
example. However, it should be understood that the 
controlled device 14 can comprise any appliance or 
5 device capable of being controlled electronically. For 
example, the controlled device may comprise a video 
cassette recorder (VCR) in which the spoken commands 
dictate the date, time and channel selection for future 
recording by the VCR, or simple commands such as play 

10 or rewind. Still other examples of appliances or 

devices which may comprise the controlled device 14 
include kitchen appliances such as ovens and ranges, 
other household appliances, security systems, 
automotive accessories and office business equipment. 

15 Referring again to Fig. l, the voice signals 

transmitted by the remote control device 12 are 
received by the controlled device 14. The controlled 
device 14 includes an infrared sensor 22 for sensing 
the infrared signals transmitted by the infrared 

20 transmitter 20. The sensed signals are processed by an 
audio receiver 24 which demodulates the transmitted 
signal as well as amplifying and filtering such signal 
to provide to a voice recognition circuit 26 the voice 
signal represented thereby. The voice recognition 

25 circuit 26 performs a voice recognition process in 
which the voice signal is compared with pre-stored 
representations of different words of a vocabulary. If 
substantial similarity is found , then an action routine 
corresponding to the pre-stored word found to be 

30 substantially similar to the voice signal is provided 
to the controlled device 14 by the voice recognition 
circuit 26, so that a desired function is performed 
within the controlled device 14. 

A detailed example of the remote control device 12 

35 of Fig. 1 is shown in Fig. 2. As described above in 

connection with Fig. 1, the microphone 16 responds to a 
word spoken by the user by producing an analog audio 
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signal. Such signal is amplified in an audio amplifier 
28. The audio amplifier 28 conditions the signal for 
proper application to a voltage controlled oscillator 
(VCO) 30 to frequency modulate (FM) a signal at the 
5 output of the VCO 30 in accordance with the analog 

audio signal from the amplifier 28. The output of the 
VCO 30 is conditioned by a driver circuit 32 prior to 
driving the infrared transmitter 20. 

The remote control device 12 is powered by 

10 batteries 34. Because there is no memory or 

microprocessor in the remote control device 12 , there 
is no need to keep the remote control device 12 powered 
when not in use. Accordingly, a push-to-talk switch 36 
is used to provide the power from the batteries 34 only 

15 when the user wishes to speak into the remote control 
device 12. Pushing a button on the outside of the 
remote control device 12 closes the push-to-talk switch 
36 to couple the batteries 34 to a voltage regulator 
38. The voltage regulator 38, which is coupled between 

20 a common ground line 40 and the push-to-talk switch 36 f 
has an output terminal 42 thereof coupled to provide a 
supply voltage (+V) at a terminal 44. This regulated 
supply voltage is applied to the microphone 16, the 
audio amplifier 28 and the voltage controlled 

25 oscillator 30. The batteries 34 are directly coupled 
to the driver 32 by the push-to-talk switch 36. The 
regulated supply voltage +V optimizes the operating 
conditions of the voltage controlled oscillator 30. 
Whenever the user wishes to transmit a voice 

30 command to the controlled device 14, the user pushes 
the button to close the push-to-talk switch 36 and 
speaks the voice command into the microphone 16. The 
resulting analog audio signal is amplified by the 
amplifier 28 and applied to the VCO 30 to frequency 

35 modulate the output of the VCO 30. This FM signal is 
applied via the driver 32 to the infrared transmitter 
20, for transmission as an infrared signal to the 
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controlled device 14. When the user is not 
transmitting voice commands, the push-to-talk switch 36 
is open, and the batteries 34 are not coupled to the 
voltage regulator 38 or to the driver 32. 
5 When the user pushes the button to close the push- 

to-talk switch 36 and speaks a command word into the 
microphone 16, the remote control device 12 transmits a 
frequency modulated infrared signal from the infrared 
transmitter 20. The infrared transmitter 20 may 

10 comprise an infrared diode or other appropriate 

infrared transmitting device. Such infrared signals 
are sensed by the infrared sensor 22 and applied to the 
audio receiver 24. A detailed example of the audio 
receiver 24 is shown in Pig. 3. 

15 Referring to Fig. 3, the infrared sensor 22, which 

may comprise any appropriate form of infrared sensor 
such as those typically used in remote control devices, 
provides the received signal for amplification by an 
amplifier 46 and filtering by a filter 48. The output 

20 of the filter 48 is applied to a phase-locked loop 

receiver 50 which demodulates the frequency modulated 
signal by converting it back to a voice signal. The 
phase-locked loop receiver 50 is of conventional phase- 
locked loop (PLL) configuration, and includes a phase 

25 detector 52 coupled to the filter 48 and having an 
output and a second input coupled in a loop which 
includes a loop filter 54 and a voltage controlled 
oscillator (VCO) 56. The output of the phase-locked 
loop receiver 50 is filtered by an audio filter 58 and 

30 amplified by an audio amplifier 60 to provide the 
demodulated voice signal. The phase-locked loop 
receiver 50 also provides a lock detect signal, which 
indicates when a transmitted signal is present in the 
audio receiver 24. 

35 The phase-locked loop receiver 50 is described 

herein by way of example only, and it should be 
understood that other demodulation receivers can be 
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used as desired. For example, a super-heterodyne 
receiver can be used in applications posing more 
stringent requirements. 

Fig. 4 shows the voice recognition circuit 26 of 
5 Fig. 1 in greater detail in conjunction with a 
television set 62. In the present example, the 
television set 62 comprises the appliance or device 
being controlled by the voice commands from the remote 
control device 12. The television set 62 is of 

10 conventional configuration and includes a cathode ray 
tube (CRT) 64 for providing the TV screen. The CRT 64 
is driven by a tuner 66 through a video circuit 68. An 
on-screen display circuit 70, which is coupled to the 
video circuit 68, provides additional screen displays 

15 such as menus of the type which invite the viewer to 

make selections by speaking further voice commands into 
the remote control device 12. As described hereafter, 
the on-screen display circuit 70 is used to provide on- 
screen displays requesting the user of the remote 

20 control device 12 to enter further voice commands. The 
television set 62 also includes an audio circuit 72 
which is coupled to speakers 74 for the television set 
62. 

The voice recognition circuit 26 comprises a 
25 microcontroller which includes an analog-to-digital 

converter (A/D) 76 coupled to receive the voice signal 
from the audio receiver 24. The microcontroller of the 
voice recognition circuit 26 also includes a 
microprocessor (MP) 78 coupled to the A/D converter 76, 
30 and having access to both a read only memory (ROM) 80 
and a random access memory (RAM) 82. 

The microcontroller comprising the voice 
recognition circuit 26 is coupled to a digital IR input 
84 and to an external DRAM 86. With the exception of 
35 the external DRAM 86 and the audio receiver 24, the 

remaining components shown in Fig. 4, including those 
within the microcontroller of the voice recognition 
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circuit 26, are standard components found in most 
modern television sets. The external DRAM 86 is 
utilized for voice recognition storage requirements; 
primarily storage of voice templates. Therefore, the 
conventional television set 62 is easily modified by 
addition of the audio receiver 24 and the external DRAM 
86 in order to make it responsive to voice commands 
transmitted by the remote control device 12. Again, 
the microcontroller comprising the voice recognition 
circuit 26 is part of the existing circuitry within the 
conventional television set 62. Accordingly, voice 
operated remote control systems 10 in accordance with 
the invention do not require the addition of a 
microprocessor but only the modification of an existing 
one within the controlled device 14, principally 
through the addition of the external DRAM 86. 

The microcontroller comprising the voice 
recognition circuit 26 operates in the same manner as 
described in the previously referred to co-pending 
applications, Serial Nos. 07/915,112, 07/915,938 and 
07/915,114. Such applications are incorporated herein 
by reference. As described in detail in co-pending 
application Serial No. 07/915,122, for example, the A/D 
converter 76 of Fig. 4 may comprise an 8 -bit converter 
which samples incoming data at a preassigned frequency 
such as 9.6 KHz. The A/D converter 76 outputs a 
digital signal representing the input analog voice 
signal from the audio receiver 24. The microprocessor 
78 processes the digital voice signal together with a 
voice recognition software routine forming part of a 
control program stored in the ROM 80. The digital 
voice signal is converted into an incoming voice 
template that is compared against previously stored 
voice templates of the user's voice, stored in the 
external DRAM 86. The program decodes the voice 
templates. Together with the external DRAM 86, the RAM 
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82 comprises a reference memory for temporary storage 
of data. 

Thus, the analog voice signal from the audio 
receiver 24 is applied to the A/D converter 76 for 
5 conversion into an incoming digital voice signal. The 
reference memory, comprised of the external DRAM 86 in 
conjunction with the RAM 82, stores a plurality of 
reference digital voice templates. The ROM 80 stores 
the control program. The microprocessor 78, which is 

10 coupled to the A/D converter 76, the ROM 80 and the RAM 
82, generates an incoming digital voice template from 
the incoming digital voice signal at the output of the 
A/D converter 76. The microprocessor 78 then executes 
the control program to determine whether the incoming 

15 digital voice template is substantially equivalent to 

one of the reference digital voice templates, stored in 
the reference memory comprised of the RAM 82 and the 
external DRAM 86. The microprocessor 78 determines 
what action to take corresponding to a reference 

20 digital voice template, if the incoming digital voice 

template is found to have substantial similarity to the 
reference digital voice template. The microcontroller 
effects the appropriate components of the television 
set to perform the desired function within the 

25 controlled device 14. 

Voice control is made possible, in the voice 
operated remote control system 10, by first voice 
training the collection of reference digital voice 
templates in accordance with the user's voice. Such 

30 templates are collected in the same manner as described 
in co-pending application Serial No. 07/915,112. In 
the example of Fig. 4, however, the screen of the 
television set 62 (CRT 64) is used to provide an on- 
screen display which prompts the user by requesting the 

35 needed words. The user responds by pressing the button 
to close the push-to- talk switch 36 and speaking the 
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prompted word into the microphone 16 of the remote 
control device 12. 

When voice training is complete, the voice operated 
remote control system 10 is ready for use. The basic 
flow chart of Fig. 5 illustrates the manner in which 
the controlled device 14 operates in conjunction with 
the remote control device 12. To begin with, the 
controlled device 14 is in an idle mode 88, waiting for 
commands. When the audio receiver 24 detects a signal 
and locks on to accomplish demodulation, the lock 
detect signal causes an exit from the idle mode 88, 
with a visual cue to be presented to the user. As 
shown in Fig. 5, the absence (No) of a lock detect 90 
causes the system to return to the idle mode 88. 
However, when a lock detect (Yes) is present, visual 
feedback 92 is provided the user. This comes in the 
form of an on-screen display. The user responds by 
speaking the desired command or string of commands, and 
voice recognition 94 thereof produces selected action 
96 within the controlled device 14 through execution of 
an action routine by the microprocessor 78. Afterward, 
the idle mode 88 is returned to. 

Fig. 6 comprises a detailed flow chart illustrating 
the operation of the circuit of Fig. 4 in conjunction 
with the remote control device 12. As in the case of 
the flow chart of Fig. 5, the starting point in the 
flow chart of Fig. 6 is an idle mode 98. When a lock 
detect 100 is not present (No) , the system always 
returns to the idle mode 98. When a lock detect 100 is 
present (Yes) , the idle mode 98 is exited and a check 
is made to determine if the television set 62 is turned 
on ("TV on" block 102). If the television set 62 is 
not turned on (No) , then the set is turned on ("turn 
on" block 104) . If the television set 62 is turned on 
(Yes) , then the system proceeds to an initial visual 
("cue" block 106). This feature allows the user to 
turn on the television set 62 without using a voice 
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command or a separate button therefor on the remote 
control device 12. With on-screen display, the visual 
cue 106 can simply be a symbol indicative of speech. 
Because the controlled device 14 is designed with voice 
5 recognition in mind, it is also possible for the volume 
of the television set 62 to automatically be decreased 
to a moderate level to facilitate better voice 
recognition, at this point. Once the visual cue 106 is 
displayed, the user knows that the television set 62 is 

10 " 1 i s tening" . 

At this point, the user has a choice of turning the 
television set 62 off ("Off"), changing the channel 
("Channel") , adjusting the volume ("Volume"), or muting 
the audio ("Mute"), as represented by a function block 

15 108 in Fig. 6. When a voice command word is 

recognized, the on-screen display shows the word 
recognized or an appropriate symbol indicative of the 
word or the action denoted thereby. 

If the user says the word "Off", then the 

20 television set 62 is turned off ("turn off" block 110) 
and the idle mode 98 is returned to. 

If the user says the word "Channel", then the word 
"Channel" is displayed on the television screen and a 
second group of words is possible. This includes the 

25 possibility of either "Up" or "Down" commands, or entry 
of the digits 0 through 9. "Up" or "Down" commands 
result in changing the channel by one channel and 
returning to the same choices. This allows the user to 
single-step through the channels by speaking the words 

30 "Up" or "Down" several times. If a channel number is 

spoken and recognized, then the user can either release 
the push-to-talk switch 36, which causes the channel to 
change to the appropriate single-digit channel, or the 
user can speak another digit causing the channel to be 

35 changed to the appropriate 2 -digit channel. 

This is shown in Fig. 6 by a block 112 which 
represents the choices of entering a particular channel 
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number (0-9) or an "Up" or "Down" command. Again, an 
indication of lock lost results in returning to the 
idle mode 98. If the user decides to single-step 
through the channels by saying "Up" or "Down" several 
5 times, then the system increments in the "Up" or "Down" 
direction, as represented by a "channel up" block 114 
or a "channel down" block 116, respectively. 
Alternatively, the user can speak a number in the 0-9 
range. If the spoken number is recognized, the user 

10 can release the push-to-talk switch 3 6 and this causes 
the channel to be changed to the appropriate single- 
digit channel, as represented by a block 118 shown in 
Fig. 6. Alternatively, the user can continue pushing 
the push-to-talk switch 3 6 and speak a second digit, 

15 causing the channel to be changed to the appropriate 2- 
digit channel, as represented by a block 120 in Fig. 6. 

If the user speaks the word "Volume", then the word 
"Volume" is displayed together with a visual indication 
of the present volume on the screen of the television 

20 set 62, and a second word consisting of either "Up" or 
"Down" is accessed, as represented by a block 122 in 
Fig. 6. Speaking the word "Up" causes the volume to be 
increased by one level, as represented by a block 124. 
Conversely, speaking the word "Down" causes the volume 

25 to decrease by one level, as represented by a block 
126. 

If the user speaks the word "Mute", the 
microcontroller checks the state of the audio output, 
in a "muted" block 126. If the audio output is on 

30 (No), then the audio signal is turned off , as 

represented by a mute sound block 128. Conversely, if 
the audio output is off (Yes) , then the audio output is 
turned back on to the level where it was prior to being 
muted ("enable sound" block 130) . 

35 As previously discussed, the voice operated remote 

control system 10 in accordance with the invention is 
described herein in terms of transmission of an 



WO 95/06309 



PCTYUS94/09544 



- 18 - 

infrared signal for purposes of example only. In 
actual practice , other transmission media such as FM or 
AM radio frequency (RF) signals can be used. VCRs, 
audio tape and disc players, and radio receivers, can 
all be controlled using the techniques described 
herein. Also, other appliances or devices such as 
household appliances, automotive accessories, home 
security systems, and office equipment can be 
controlled in accordance with the techniques described 
herein. 

In accordance with the invention, multiple users 
can utilize a single remote control device 12 . The 
voice of each user is trained, and the templates which 
result therefrom are stored in the controlled device 14 
separately from templates of the other users. 

In a first method for accommodating multiple users, 
each user speaks his or her name at the appropriate 
word group, and this causes the controlled device 14 to 
change to the new user's voice templates. Once the 
name is recognized, the on-screen display shows the 
user's name and proceeds to use that user's templates, 
until the name is changed. Even if the controlled 
device 14 is turned off, such device retains the 
current user identification and identifies whom it has 
retained upon turning on of the system. This method 
has the advantage that the remote control device 12 
does not have any components added to support multiple 
voices. In fact, the number of user voices supported 
is practically limited by the choice of the system 
software. Also, this method does not require that the 
user remember a switch number or other identification 
corresponding to his or her voice. The disadvantage of 
such method relates to the need to add more words to 
the word groups, as well as possible confusion as to 
whose voice templates are presently being used. 

In accordance with a second method of accommodating 
multiple users, a second, multiple position switch is 
incorporated into the remote control device 12. When 



WO 95/06309 



PCT/US94/09544 



- 19 - 

the position of the switch is changed by the user, a 
series of pulses is transmitted to the controlled 
device 14 for interpretation to identify the user. The 
controlled device 14 then changes to the new user's 
5 templates . 

In situations where multiple voice operated remote 
control systems 10 are being simultaneously used, it is 
necessary to be able to distinguish the transmitted 
signals from the different systems so that confusion 

10 does not occur. One solution to this problem is to use 
multiple frequencies. Because the receiving circuitry 
is frequency selective, multiple communication channels 
can be supported. It is best to design the controlled 
device 14 with the ability to have the channel thereof 

15 changed by the user with a switch or other means. In 

that instance, the simple remote controls are comprised 
of simple, fixed-channel devices. While the invention 
has been particularly shown and described with 
reference to preferred embodiments thereof, it will be 

20 understood by those skilled in the art that various 

changes in form and details may be made therein without 
departing from the spirit and scope of the invention. 
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IN THE CLAIMS : 

1. A voice operated remote control system 
comprising the combination of: 

a remote control device responsive to a voice 
5 command for transmitting a representation of the voice 
command; and 

a control device including means responsive to the 
transmitted representation of the voice command for 
producing a voice signal corresponding to the 
10 transmitted representation and voice recognition means 
for recognizing the voice signal and producing an 
action routine denoted thereby. 

2. A voice operated remote control system in 
accordance with claim 1, wherein the remote control 

15 device includes means for converting the voice command 
into an analog signal and means for modulating the 
analog signal to produce a transmission signal , and the 
means for producing a voice signal includes means for 
demodulating the transmission signal • 

20 3. A voice operated remote control system in 

accordance with claim 2, wherein the means for 
modulating performs FM modulation of the analog signal 
to produce a transmission signal and the means for 
demodulating performs FM demodulation of the 

25 transmission signal. 
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4. A voice operated remote control system in 
accordance with claim 1, wherein the voice recognition 
means includes a reference memory for storing a 
plurality of reference voice templates, a program 

5 memory for storing a control program, and a processor 
coupled to the reference memory and the program memory 
for generating an incoming voice template in response 
to each voice signal produced by the means for 
producing a voice signal corresponding to the 

10 transmitted representation, and for executing the 

control program to determine whether the incoming voice 
template is substantially equivalent to one of the 
reference voice templates, and for selecting one of a 
plurality of action routines based on the incoming 

15 voice template. 

5. A voice operated remote control system in 
accordance with claim 4, wherein the voice signal is an 
analog signal, and the voice recognition means further 
includes an analog-to-digital converter for converting 

20 each analog voice signal into an incoming digital voice 
signal, and the reference memory stores a plurality of 
reference digital voice templates. 

6. A method of controlling by voice command a 
controlled device, comprising the steps of: 

25 responding to a voice command at a location remote 

from the controlled device by converting the voice 
command and transmitting the converted voice command; 
and 

responding to a transmitted converted voice command 
30 at the controlled device by converting the transmitted 
converted voice command into a voice signal, performing 
voice recognition of the voice signal, and providing an 
action routine to the control device in accordance with 
the voice recognition performed on the voice signal. 
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7 . A method of controlling in accordance with 
claim 6, wherein converting the voice command comprises 
frequency modulating the voice command, and converting 
the transmitted converted voice command comprises 

5 frequency demodulating the transmitted converted voice 
command. 

8 . A remote control device comprising the 
combination of: 

means responsive to a voice command for producing a 
10 corresponding analog voice signal; 

means for modulating the analog voice signal; and 
means for transmitting the modulated analog voice 
signal. 

9. A remote control device in accordance with 
15 claim 8, wherein the means for producing a 

corresponding analog voice signal comprises a 
microphone, and further including an audio amplifier 
for amplifying analog voice signals provided by the 
microphone. 

2 0 10. A remote control device in accordance with 

claim 8, wherein the means for modulating comprises a 
voltage controlled oscillator for frequency modulating 
the analog voice signal. 

11. A remote control device in accordance with 

25 claim 8, wherein the means for transmitting comprises a 
driver coupled to an infrared transmitter. 

12. A remote control device in accordance with 
claim 8, further including a battery and a push-to-talk 
switch for selectively coupling the battery to power 

30 the means for producing, the means for modulating and 
the means for transmitting. 



WO 95/06309 



PCT7US94/09544 



- 23 - 

13. A device for operating in response to a 
transmitted audio signal comprising the combination of: 

means for sensing a transmitted audio signal; 
means responsive to a sensed audio signal for 
5 converting the audio signal into a voice signal; 

means responsive to the voice signal for performing 
voice recognition thereon; and 

means responsive to the voice recognition for 
producing an action routine to determine operation of 
10 the device, 

14. A device in accordance with claim 13, wherein 
the means for sensing comprises an infrared sensor. 

15. A device in accordance with claim 13, further 
including means for amplifying a sensed transmitted 

15 audio signal and means for filtering the amplified 

sensed transmitted audio signal, and wherein the means 
for converting comprises a phase-locked loop receiver 
for frequency demodulating the filtered amplified 
sensed transmitted audio signal. 

20 16. A device in accordance with claim 15, further 

including an audio filter for filtering a frequency 
demodulated output of the phase-locked loop receiver 
and an audio amplifier for amplifying an output of the 
audio filter to produce the voice signal. 
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17. A device in accordance with claim 13, wherein 
the means for performing voice recognition comprises: 

an analog-to-digital converter for converting the 
voice signal into an incoming digital voice signal; 
5 a reference memory for storing a plurality of 

reference digital voice templates; 

a program memory for storing a control program; and 
a processor coupled to the converter, the reference 
memory and the program memory for generating an 
10 incoming digital voice template from the incoming 

digital voice signal, for executing the control program 
to determine whether the incoming digital voice 
template is substantially equivalent to one of the 
reference digital voice templates, and for selecting 
15 one of a plurality of action routines based on the 
incoming digital voice template ♦ 

18. A device in accordance with claim 13, further 
including a television set coupled to be controlled by 
execution of the selected one of a plurality of action 

20 routines. 

19 • A device in accordance with claim 18, wherein 
the television set includes means for producing on- 
screen displays, in response to execution of the 
selected one of a plurality of action routines 
25 requesting a user to transmit a further audio signal. 
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20. A voice operated wireless remote control 
system comprising the combination of: 

a remote control device, including means responsive 
to a voice command for producing an analog audio 
signal, means for modulating the analog audio signal, 
and means for wireless transmission of the modulated 
analog audio signal; and 

a controlled device, including means for sensing a 
transmitted modulated analog audio signal, means for 
demodulating a sensed signal to produce a voice signal, 
means for performing voice recognition of the voice 
signal, and means responsive to performance of voice 
recognition for providing an action routine to the 
controlled device. 

21* A voice operated wireless remote control 
system in accordance with claim 20, wherein the means 
for wireless transmission comprises an infrared 
transmitter and the means for sensing comprises an 
infrared sensor. 

22. A voice operated wireless remote control 
system in accordance with claim 21, wherein the means 
for modulating comprises a frequency modulator and the 
means for demodulating comprises a frequency 
demodulator . 

23. A voice operated wireless remote control 
system in accordance with claim 22, wherein the 
frequency modulator comprises a voltage controlled 
oscillator and the frequency demodulator comprises a 
phase-locked loop receiver. 
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24 . A voice operated wireless remote control 
system in accordance with claim 23 , wherein the means 
for producing an analog audio signal comprises a 
microphone and an audio amplifier, the means for 
5 sensing a transmitted modulated analog audio signal 
comprises an infrared sensor, an amplifier and a 
filter, and the means for demodulating a sensed signal 
includes an audio filter coupled to the phase-locked 
loop receiver and an audio amplifier coupled to the 
10 audio filter for producing the voice signal. 
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